AITopics | unpaired image-to-image translation

Collaborating Authors

unpaired image-to-image translation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

UnpairedImage-to-ImageTranslationwithDensity ChangingRegularization

Neural Information Processing SystemsFeb-11-2026, 13:15:23 GMT

In unpaired image translation setting, we are given two collections of samples without pairing information and we need to learn a proper mapping from one domain to another.

artificial intelligence, machine learning, translation, (17 more...)

Neural Information Processing Systems

Country: North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations

Neural Information Processing SystemsDec-23-2025, 20:06:52 GMT

Score-based diffusion models (SBDMs) have achieved the SOTA FID results in unpaired image-to-image translation (I2I). However, we notice that existing methods totally ignore the training data in the source domain, leading to sub-optimal solutions for unpaired I2I. To this end, we propose energy-guided stochastic differential equations (EGSDE) that employs an energy function pretrained on both the source and target domains to guide the inference process of a pretrained SDE for realistic and faithful unpaired I2I. Building upon two feature extractors, we carefully design the energy function such that it encourages the transferred image to preserve the domain-independent features and discard domain-specific ones. Further, we provide an alternative explanation of the EGSDE as a product of experts, where each of the three experts (corresponding to the SDE and two feature extractors) solely contributes to faithfulness or realism. Empirically, we compare EGSDE to a large family of baselines on three widely-adopted unpaired I2I tasks under four metrics. EGSDE not only consistently outperforms existing SBDMs-based methods in almost all settings but also achieves the SOTA realism results without harming the faithful performance. Furthermore, EGSDE allows for flexible trade-offs between realism and faithfulness and we improve the realism results further (e.g., FID of 51.04 in Cat $\to$ Dog and FID of 50.43 in Wild $\to$ Dog on AFHQ) by tuning hyper-parameters. The code is available at https://github.com/ML-GSAI/EGSDE.

egsde, energy-guided stochastic differential equation, unpaired image-to-image translation, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.76)

Add feedback

EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations

Neural Information Processing SystemsOct-9-2024, 23:31:15 GMT

egsde, energy-guided stochastic differential equation, unpaired image-to-image translation, (3 more...)

Neural Information Processing Systems

Genre: Play > Prospect (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.79)

Add feedback

Unpaired Image-to-Image Translation via Latent Energy Transport

Zhao, Yang, Chen, Changyou

arXiv.org Machine LearningDec-1-2020

Image-to-image translation aims to preserve source contents while translating to discriminative target styles between two visual domains. Most works apply adversarial learning in the ambient image space, which could be computationally expensive and challenging to train. In this paper, we propose to deploy an energy-based model (EBM) in the latent space of a pretrained autoencoder for this task. The pretrained autoencoder serves as both a latent code extractor and an image reconstruction worker. Our model is based on the assumption that two domains share the same latent space, where latent representation is implicitly decomposed as a content code and a domain-specific style code. Instead of explicitly extracting the two codes and applying adaptive instance normalization to combine them, our latent EBM can implicitly learn to transport the source style code to the target style code while preserving the content code, which is an advantage over existing image translation methods. This simplified solution also brings us far more efficiency in the one-sided unpaired image translation setting. Qualitative and quantitative comparisons demonstrate superior translation quality and faithfulness for content preservation. To the best of our knowledge, our model is the first to be applicable to 1024$\times$1024-resolution unpaired image translation.

autoencoder, latent space, translation, (16 more...)

arXiv.org Machine Learning

2012.00649

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Generating large labeled data sets for laparoscopic image processing tasks using unpaired image-to-image translation

Pfeiffer, Micha, Funke, Isabel, Robu, Maria R., Bodenstedt, Sebastian, Strenger, Leon, Engelhardt, Sandy, Roß, Tobias, Clarkson, Matthew J., Gurusamy, Kurinchi, Davidson, Brian R., Maier-Hein, Lena, Riediger, Carina, Welsch, Thilo, Weitz, Jürgen, Speidel, Stefanie

arXiv.org Machine LearningJul-5-2019

In the medical domain, the lack of large training data sets and benchmarks is often a limiting factor for training deep neural networks. In contrast to expensive manual labeling, computer simulations can generate large and fully labeled data sets with a minimum of manual effort. However, models that are trained on simulated data usually do not translate well to real scenarios. To bridge the domain gap between simulated and real laparoscopic images, we exploit recent advances in unpaired image-to-image translation. We extent an image-to-image translation method to generate a diverse multitude of realistically looking synthetic images based on images from a simple laparoscopy simulation. By incorporating means to ensure that the image content is preserved during the translation process, we ensure that the labels given for the simulated images remain valid for their realistically looking translations. This way, we are able to generate a large, fully labeled synthetic data set of laparoscopic images with realistic appearance. We show that this data set can be used to train models for the task of liver segmentation of laparoscopic images. We achieve average dice scores of up to 0.89 in some patients without manually labeling a single laparoscopic image and show that using our synthetic data to pre-train models can greatly improve their performance.

artificial intelligence, machine learning, translation, (17 more...)

arXiv.org Machine Learning

1907.02882

Country: Europe > Germany (0.29)

Genre: Research Report (0.40)

Industry: Health & Medicine > Therapeutic Area (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.48)

Add feedback

eriklindernoren/Keras-GAN

@machinelearnbotFeb-28-2018, 08:45:01 GMT

Keras implementations of Generative Adversarial Networks (GANs) suggested in research papers. If dense layers produce reasonable results for a given model I will often prefer them over convolutional layers. The reason is that I would like to enable people without GPUs to test these implementations out. These models are in some cases simplified versions of the ones ultimately described in the papers, but I have chosen to focus on getting the core ideas covered instead of getting every layer configuration right. However, because of this the results will not always be as nice as in the papers.

artificial intelligence, implementation, machine learning, (9 more...)

@machinelearnbot

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Unsupervised or Indirectly Supervised Learning (0.64)

Add feedback

junyanz/pytorch-CycleGAN-and-pix2pix

#artificialintelligenceApr-19-2017, 21:25:29 GMT

This is our ongoing PyTorch implementation for both unpaired and paired image-to-image translation. The code was written by Jun-Yan Zhu and Taesung Park. Check out the original CycleGAN Torch and pix2pix Torch code if you would like to reproduce the exact same results as in the papers. More example scripts can be found at scripts directory. To train a model on your own datasets, you need to create a data folder with two subdirectories trainA and trainB that contain images from domain A and B. You can test your model on your training set by setting phase'train' in test.lua.

artificial intelligence, junyanz pytorch-cyclegan-and-pix2pix, machine learning, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.62)

Add feedback